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PRIMER-SPECIFIC AND MISPAIR EXTENSION ASSAY FOR IDENTIFYING GENE 

VARIATION 

Cross-reference to Related Applications: This application claims priority to, and is a 
continuation of, International Application No. PCT/CA99/00733, filed on 9 August 1999, 
designating the United States of America, the contents of which are incorporated by this reference, 
the PCT International Patent Application itself claiming priority from the Canadian Application 
Serial No. 2,245,039 filed 13 August 1998. 

BACKGROUND OF THE INVENTION 

Field of the Invention: The invention relates to primer-specific and mispair extension assay 

for identifying gene variations, such as in different genotypes or subtypes of a given genotype. 

State of the Art: Current genotyping systems are technically complex, time-consuming and 
error-prone in the detection of a single nucleotide variation and low level heterozygotes. 

Despite current genotyping systems, such as restriction fragment length polymorphism 
analysis (RFLP), hybridization, for example, line probe assay, ("LiPA"), selective DNA 
amplification by PCR-type specific primers (Okamoto, H., etal, 7. Gen. Virol 73: 673, 678,1992) 
and direct DNA sequencing, having been useful in general, some technical problems still remain and 
limits their applications. 

Development of viral resistance to antiviral drugs used for treatment of HIV- 1 infection is 
an important cause of treatment failure (Coffin JM., Sciences, 261 \ 483-489, 1995). In addition, drug 
resistant mutations can give rise to selective cross-resistance to other antiviral drugs, which has 
limited the options available for alternative antiviral regimens (Cohen OI, and Fauci AS., N. Engl 
J. Med., 339: 341-243, 1998). Thus genotypic drug resistance testing plays an important role in 
selecting an initial antiviral regimen and changing therapy with alternate regimens as the need arises. 
However, current genotyping assays, including direct DNA sequencing, yield information on only 
the predominant viral 
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quasispecies due to the inability of these assays to 
detect low levels of viral variants (Gunthard HF, et 
al., AIDS Res. Hum. Retroviruses, 14: 869-876, 1998). 
ABI automated sequencer was able to detect mutants at 
5 levels of 10-50% in an artificially mixed sample only 
when the mixes were analyzed by editing the sequences 
manually. Hybridization based high-density 

oligonucleotide arrays (GeneChip) (Deeks SG, Abrams 
DI., Lancet, 349: 1489-1490, 1997) were less sensitive 
10 than the ABI sequencer, . being able to detect mutations 
at levels of 25-75% in the mix (Gunthard HF, et al . , 
AIDS Res. Hum. Retroviruses, 14: 869-876, 1998). 
Similar results were obtained using an automated DNA 
sequencer {Visible Genetics) (Hu YW, et al.. Reliable 
15 detection of mixed HCV genotype infections using a 
novel genotyping assay. 5 ch International Meeting on 
Viral Hepatitis C Virus and Related viruses, Molecular 
Virology and Pathogenesis, Venezia, Italy, 1998) . Line 
Probe Assay (LiPA) , which uses reverse hybridization 
2 0 technology, is relatively rapid and could detect 
mutants at levels as low as 5% (Stuyver L, et al 
Antimicrob. Agents Chemother., 41: 284-291, 1997), but 
again may not be suitable for detection of mixed 
genotypes because it may give no results due to nearby 
25 polymorphism that impair hybridization (Stuyver L, et 
al., Antimicrob. Agents Chemother. , 41: 284-291, 1997).. 
Moreover, in 40% of the samples tested, LiPA failed to 
yield correct results for some of the drug resistance 
mutations (Puchhammer-Stockl E, et al., J. Med. Virol. t 
30 57: 283-289, 1999). Population based sequencing (i.e. 
cloning and sequencing) is the gold standard method for 
detection of minor drug resistance mutants. 
Unfortunately, it is impractical for clinics and large 
cohort studies. 
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For most indirect DNA sequencing genotyping 
systems, a common weakness is that they are not as 
accurate as direct DNA sequencing analysis in 
particular for detection of a single nucleotide 
5 mutation or variation, resulting in considerable 
instances of errors or inconsistent results (Andonov, 
A., et al., J. Clin. Microbiol. 32: 2031-2034,1994); 
(Tuveri, R., et al . , Journal of Medical Virology 51: 
36-41 (1997); Okamoto, H., et al., J. Virol. Methods 

10 57: 31 45 002-16, 1996). Although direct DNA 

sequencing is the most reliable method for genotyping, 
this is not practical for large cohort studies. 

For example, International Publication No. WO 
91/13075 describes a method for detecting variable 

15 nucleotides based on primer extension and incorporation 
of detectable nucleoside triphosphate using T7 
polymerase for extending the primer. However, T7 
polymerase does not have the proofreading activity and 
the 3'-5» exonuclease activity of Pfu resulting in 

20 false positive or false negative reactions. Moreover, 
the method uses ddNTPs for terminating extension. 

Another major limitation of all current 
genotyping systems, including direct DNA sequencing is 
that they can not reliably detect low level of 

25 heterozygotes (Tuveri, R., et al . , Journal of Medical 
Virology 51: 36-41, 1997); (Lau, J.Y., et al . , J . 
Infect. Dis. 171: 281-289, 1995); (Forns, X., et al., 
J. Clin. Microbiology. 34-10: 2516-2521, 1996); or 
mixed genotype infectious (Tuveri, R. , et al . , Journal 

30 of Medical Virology 51: 36-41, 1997); (Lau, J.Y., et 
al., J". Infect. Dis. 171: 281-289, 1995); (Forns, X. , 
et al., J. Clin. Microbiology. 34-10: 2516-2521, 1996). 

International Publication No. WO 96/30545 
discloses a method for simultaneously analyzing a 

35 genetic mutation and a corresponding wild-type sequence 
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within a sample. The method utilizes ddNTPs for 
terminating primer extension. However, the use of 
ddNTPs increases the background of the assay, reducing 
its sensitivity and capacity to detect low levels of 
5 heterozygotes, similarly as in WO 91/13075. 

HCV was recognized as the major etiologic 
agent of blood borne non-A, non-B hepatitis soon after 
the virus was identified in 1989. As an RNA virus, HCV 
shows great genetic variability, resulting in the 

10 existence of types, subtypes and quasispecies . At 
present, 11 types and at least 50 subtypes have been 
described. However, types la, lb, 2a, 2b and 3a have 
been found to be generally the most" prevalent 
(Simmonds, P., Hepatology 21: 570-582, 1995). Subtype 

15 lb is the most common genotype found in Japan (Okamoto, 
H., et al., J. Gen. Virol. 73: 673, 678, 1992) and 
European countries while subtypes la and lb are the 
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most common genotypes in the United States (Lau, J.Y. , 
et al., J\ Infect. Dis. 171: 281-289, 1995) and Canada, 
(Andonov, A. , et al . , J". Clin. Microbiol. 32: 2031- 
2034, 1994). Viruses of various genotypes contain 
5 different antigenic properties, which have potentially 
important consequences for the development of a vaccine 
and for antibody screening tests. Also, the disease 
severity and response to interferon may be influenced 
by the virus types and subtypes (Simmonds, Hepatology 

10 21: 570-582, (1995). Subtype lb was reported to be 
associated with a high severity of the disease and low 
response to interferon (Simmonds, Hepatology 21: 570- 
582, (1995) . It is apparent that a rapid, simple, 
accurate and inexpensive genotyping method is urgently 

15 needed. 

Amplification refractory mutation system (ARMS) 
(Newton, C.R., et al . , Nucl. Acids Res. 17: 2503-2516), 
improved the methods used in the prior art for typing 
the five most common genotypes (Pistello, M. , et al., 

20 J. Clin. Microbiol. 32: 232-234, 1994). ARMS was 
developed for PCR detection of any point mutation in 
DNA using Tag DNA polymerase (Newton, C.R. , et al . , 
Nucl. Acids Res. 17: 2503-2516). It is based on the 
principle that oligonucleotides with a mismatched 3'- 

25 residue would not function as primers in PCR under 
controlled conditions. In some cases, however, the 
specificity of ARMS was not sufficient to give a 
correct diagnosis. The problem with non-specific 
reactivities still remains with the type-specific 

30 primer PCR method for HCV genotyping, even with the 
improvement using ARMS. 

It is apparent that the major cause of the 
nonspecific reactivities found in these assays is 
related to the use of Tag DNA polymerase due to its 

35 lack of 3 1 — >5 * exonuclease activity. This inaccuracy 
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results in base substitutions, transitions, 
tranversions , frame shifts or deletion mutations during 
DNA synthesis. Consequently, mispairs can be 

frequently formed, and Tag polymerase would be able to 
5 continue synthesizing DNA by addition of the next 
correct nucleotide on the template (Lau, J.Y., et al., 
J. Infect. D^s. 171: 281-289, 1995). Even after 
reaching the end of the template, several more 
nucleotides can be added to the extended primer because 
10 most DNA polymerases, including Tag and retroviral 
reverse transcriptase (RT) , have a non- template 
dependent DNA synthesis activity (i.e. terminal 
deoxynucleotide transferase activity) (Clark, J.M. , 
Nucleic Acids Res. 16: 9677; and Patel,.P.H., et al., 
15 Proc. Natl. Acad. Sci. U.S.A. 91: 549-553). Therefore, 
the nonspecific reaction cannot be avoided with either 
ARMS or the methods based on ARMS using Tag DNA 
polymerase . 

Two thermostable DNA polymerases pfu 
20 (pyrococcus furiosus) (Lundberg, K.S., et al., 
Pyrococcus furiousus. Gene 108: 1-6, 1991), and 
TLI/Vent (Thermocococcus litoralis) (Neuner, A., et 
al., larch. Microbiol., 153: 205-207) exhibit 3'->5' 
proofreading exonuclease activity. This ensures a high 
25 degree of amplification fidelity during DNA 
polymerization. Unlike Vent, the pfu 3'-±S< 

exonuclease activity peaks sharply at • its optimal 
polymerization temperature (75°C to 80°C) , minimizing 
undesirable primer-degradation activity (Lundberg, 
30 K.S., et al.. Gene 108: 1-6, 1991). pfu DNA polymerase 
also does not exhibit terminal deoxynucleotidyl- 
transferase (TDT) activity, which was reported to be 
involved in the high mutation rate of DNA during DNA 
polymerization. 
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It would be highly desirable to be provided 
with a simple assay or method to overcome many of these 
limitations of current DNA genotyping systems. 

It would be highly desirable to be provided 
5 with genotypic assays that possess greater accuracy and 
sensitivity for detection of minor drug resistant 
subpopulations of HIV-1 in the early stages of 
r e s i s t anc e e vol u t ion . 

10 SUMMARY OF THE INVENTION 

One aim of the present invention is to provide 
a novel primer specific and mispair extension assay 
(PSMEA) for the determination of genotypes and 
subtypes . 

1 5 Another aim of the present invent ion is to 

provide a primer specific and mispair extension assay 
(PSMEA) to detect nucleotide variations in any known 
gene sequence using* pfu DNA polymerase in the presence 
of an incomplete set of dNTPs, and only a single 

20 primer. 

Another aim of the present invention is to 
provide a tool for reliable detection of mixed genotype 
inf ectius . 

Another aim of the present invention is to 
25 provide a sensitive tool for detection of low levels of 

drug resistant mutants in patients being treated with 

antiviral drugs. 

Another aim of the present invention is to 

provide a tool for accurate genotyping. 
30 In accordance with the present invention, there 

is provided a primer- specif ic and mispair extension 

assay for the determination of genotype. The assay 

comprises the steps of: 

a) extending a nucleic acid sequence from a 

35 patient sample with a polymerase, more preferably a PFU 
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DNA polymerase, using a primer specific for a genotype 
to be determined and an incomplete set of dNTPs, under 
suitable conditions for obtaining extension products of 
the primer, whereby at least one of the primer or the 
5 dNTPs is labeled; 

b) characterizing the extension products; and 

c) comparing the extension products with known 
nucleic acid sequences of various genotypes for 
determining the genotype of the nucleic acid sequence 

10 extended. 

Preferably, the step of characterizing the 
extension products comprises the step of separating by 
size said extension products. More preferably, the 
step of characterizing the extension products further 

15 comprises after the step of separating by size the 
extension products the step of sequencing the extension 
products . 

Alternatively, the assay may further comprise 
before step a) the step of amplifying the nucleic acid 
2 0 sequence. 

Preferably, the incomplete set of dNTPs 
contains two or three different types of nucleotides. 

In a preferred embodiment, the primer is 
labeled with a radioactive label or a fluorescent 
25 label. 

In another embodiment, one of the dNTPs is 
labeled with a radioactive label or a fluorescent 
label . 

In accordance with the present invention, there 
30 is provided a novel genotyping system, primer specific 
and mispair extension analysis {PSMEA) using preferably 
the unique 3 1 to 5 1 exonuclease proofreading properties 
of pfu DNA polymerase with an incomplete set of dNTPs 
and only a single primer. 
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Also in accordance with one embodiment of the 
present invention there is provided a primer- specif ic 
and mispair extension assay for the determination of 
genotype and detection of low level mixed genotype 
5 injections or heterozygotes . The assay comprises the 
steps of: 

a) extending a DNA sequence amplified from a 
patient sample with PFU DNA polymerase using a primer 
specific for a genotype to be determined and an 

10 incomplete set of dNTPs, under suitable conditions for 
obtaining extension of the primer, whereby at least one 
of the primer or one of the dNTPs is labeled; 

b) separating the extended DNA sequences obtained 
in step a) ;and 

15 c) detecting the separated extended DNA sequences; 

and 

d) comparing the extended DNA sequences with known 

DNA sequences of various genotypes for determining the 
genotype of the DNA sequences extended. 

2 0 The primer may be end- labeled with a label or 

one of the dNTPs can be labeled with a label. The 
label can be a radioactive or fluorescent label. 

Preferably, the steps a) , b) , c) , and d) as 
described above are automated. 

25 

BRXEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates characteristics of primer 
specific and mispair extension by pfu and Taq. DNA 
polymerase; 

3 0 Fig. 2 illustrates nucleotide deletions and 

insertions by PSMEA with primer 6AR-1 and 6AR-2 
extensions by pfu on templates la and 6a) ; 

Fig. 3 illustrates a comparison of the 
sensitivity between PSMEA and direct DNA sequencing in 
35 detection of mixed genotypes; 



WO 00/09745 .ji. PCT/CA99/00733 

Fig. 4 illustrates a typical profile of 
unlabelled primer extensions by pfu on HCV templates 
for types la, lb, 2a, 2b, 3a and 3b using 32 P-labeled 
dCTP and dGTP; 

5 Figs. 5A and 5B illustrate the detection of 

HIV-l drug resistant mutant V82A and wild type V82 
using a universal primer and four sets of dNTPs ; 

Figs. 6A and 6B illustrates the detection of 
HIV-l drug resistant mutant L63P and wild type L63 
10 using matched and mismatched pairs at the 3' end of the 
primers in PSMEA; 

Figs. 7A and 7B illustrates the detection of 
HIV-l drug resistant L90M and wild type L90 using 
matched and mismatched pairs at the 3' end of the 
15 primers; and 

Fig. 8 illustrates the sensitivity of PSMEA for 
detecting low levels of HIV-l drug resistant mutant 
with a defined mix of mutant V82I and wild type V82. 

20 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with one embodiment of the 
invention, there is provided a novel primer specific 
and mispair extension assay (PSMEA) for the 
determination of genotypes, and more preferably 

25 Hepatitis C virus (HCV) genotypes and subtypes. 

PSMEA is used to detect nucleotide variations 
in any known gene sequence using pfu DNA polymerase in 
the presence of an incomplete set of dNTPs . To test 
the feasibility of PSMEA, the 5» UR of the HCV genome, 

30 known for being highly conservative, was used as a 
model for analysis of the nucleotide variation in 
determining the type and subtype of the virus. The 
results obtained demonstrate that PSMEA is a rapid, 
simple and accurate method for HCV genotyping. 
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In the 5 ? UR of the HCV genome, six major 
genotypes and some subtypes can be classified by the 
nucleotide variation in this region (Simmonds, P., 
Hepatology 21: 570-582, 1995); (Stuyver, L., et al . , 
5 J. Clin. Microbiol. 34: 2259-2266, 1996). 

The PSMEA of the present invention is a simple 
method with great potential in accurately detecting 
nucleotide mutations and which may be used for 
detecting nucleotide variations in any known gene 
10 sequence. 

PSMEA is based on the unique properties of 
3 f -»5 ! proofreading activity in a reaction with 
incomplete set of dNTPs . Under such reaction condition 
in accordance with one embodiment of the invention, pfu 
15 is extremely discriminative in nucleotide incorporation 
and proofreading at the initiation step of DNA 
synthesis, allowing for an accurate detection of 
nucleotide variation and heterozygotes in PSMEA, 

It' is known that under this reaction condition, 
20 mispair formation and extension occur during DNA 
synthesis when using reverse transcriptase and DNA 
polymerases, including enzymes that exhibit 3 ! -»5' 
proofreading exonuclease activity (Perinno, P., et al., 
Proc. Natl. Acad. Sci . USA. 86: 8343-8347, 1989); 
25 (Reha-Krantz, L.J., et al . , Proc. Natl. Acad. Sci. USA. 
88: 2417-2421, 1991). However,, the frequency of 
mispair formation and extension depends on whether the 
polymerase possesses 3 , ->5 l exonuclease activity, the 
concentration of nucleotide substrates and the 
30 composition of the mispairs (Reha-Krantz, et al . , Proc. 
Natl. Acad. Sci. USA. 88: 2417-2421, 1991) . 
Accordingly, the characteristics of primer specific and 
mispair extension by thermostable DNA polymerases 
including pfu {Pyrococcus furiosus) (Lundberg, K.S., et 
35 al., Gene 108: 1-6, 1991), Tag (Thermus aquaticus) , 
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(Saiki, R.K., et al., Science, 239: 487, 1988), and 
TLI/Vent (Thermocococcus litoralis) (Neuner, A., et 
al., larch. Microbiol., 153: 205-207) were further 
investigated. Several characteristics of primer 
5 specific and mispair extension by pfu were observed and 
found to be useful for reliable detection of nucleotide 
variation, deletion and insertion as well as 
heterozygotes . 

* In accordance with a preferred embodiment of 

10 the invention, the accuracy was evaluated by assaying 
the nucleotide variations between genotypes in the 5 ' 
untranslated region (5* UR) of the hepatitis C virus 
(HCV) genome. Mixed infections with more than one 
genotype of HCV were used for comparison of the 

15 sensitivity of PSMEA with other assays in the detection 
of heterozygotes. The feasibility of the method of the 
present invention for large cohort studies was 
demonstrated by genotyping a total of two hundred and 
forty five (245) HCV isolates. The results show that 

20 PSMEA is an extraordinary accurate system to identify 
nucleotide variation for genotyping and detecting 
heterozygous molecules, readily applicable to routine 
use. 

The design of PSMEA in accordance with a 
25 preferred embodiment of the invention is based on a 
single primer extension by the pfu DNA polymerase in 
the presence of only dGTP and dGTP, permitting accurate 
detection of nucleotide variations in the 5 1 
untranslated region (5'UR) of the HCV genome. In 
3 0 accordance with one embodiment of the invention, the 
HCV genotypes from ninety- six (96) patients and blood 
donors with HCV infection were determined by PSMEA. 
Seventy- four (74) of the samples were also genotyped by 
either the line probe assay (LiPA) or restriction 
35 fragment length polymorphism (RFLP) methods. Genotypes 
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were confirmed by nucleotide sequencing as required. 
HCV Isolates, including types and subtypes la, lb, 2a, 
2b, 2c, 3a, 3b, 4a, 5a, and 6a, were clearly identified 
by the PSMEA of the present invention. All of the 

5 types and subtypes determined by PSMEA were matched 
with those identified by Li PA or RFLP. Five (5) 
isolates of subtypes la and lb that could not be typed 
by Li PA were clearly identified by the PSMEA of the 
present invention. 

0 The primers used in the present invention were 

designed to meet the following requirements: 1) the 
sequence of primer binding site on template should be a 
type or subtype specific; 2) primers used for PSMEA 
should exhibit a similar melting temperature. In 

5 accordance with the present invention, 11 primers were 
designed for HCV genotyping with PSMEA (Table 1) . 
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Fig. 1 illustrates preferred characteristics of 
primer specific and mispair extension by pfu. 

In Fig* 1, I illustrates 32 P- labeled primer 
extensions by pfu on template lb in the presence of 
5 dCTP and dGTP {Fig, 1, 1-1 and 1, 1-2} used as markers 
for the length of primers and extended products. II 
illustrates the use of 32 P- labeled dNTPs instead of 
labeled primers for PSMEA under the same reaction 
conditions/ showing the difference in primer extension 

10 between genotypes la and lb (Fig. 1, II- 1,- II-2, II-3 
and II-4) . Ill illustrates mispair formation and 
extension by Taq on template la with 32 P- labeled 
primers 1AB and 1BR (Fig. 1, III-l and III-2) . I», II* 
and III 1 illustrate the nucleotide sequences extended 

15 on templates la and lb. Primer 1AE extended on the 
antisense strand of templates la and lb. X, XX and XXX 
represent the sites of nucleotide mismatches that 
terminated primer extension. -» represents a 

nucleotide at the 3 1 end of the primer that is 

20 complementary to the opposite nucleotide in the 
template. (A), (C) , (G) or (T) denotes the position of 
the nucleotide when a mispair is produced. -113, -108 
and -99 are the nucleotide positions in 5 ! UR of HCV. 
The underlined sequence indicates the primer binding 

25 sites. The signs in this figure are used for all other 
figures, 

A mismatched pair with nucleotide A at position 
-99 of the template HCV genotype la, that is opposite 
the first nucleotide to be incorporated at the 3 ' end 

3 0 of primer could not be produced, and subsequently 
aborted primer extension by pfu in the presence of dCTP 
and dGTP (Fig. 1, 1-1) . Under the same reaction 
conditions, the primer extended on the template of 
genotype lb because nucleotide A becomes G at position 

35 -99 (Fig. 1, 1-2), differentiating between genotypes la 
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and lb with this single nucleotide variation that is 
the only difference in the 5' UR between the two 
genotypes . 

FIG. 1 illustrates the alignment of the 5' UR 
nucleotide sequences of HCV subtypes la and lb, showing 

the homologous sequence ( ) and the difference of the 

nucleotide at position of -99 (A for la and G for lb) . 
The primer extension stopped at position of -99 where a 
mismatched pair(s) exists (X) in. The arrow (-» 
represents a nucleotide at 3- end of the primers that 
is complementary to the opposite nucleotide in the 
template. The parenthesis ( ) denotes ' that a 
nucleotide mispair occurred. 

The absence of mispair formation and extension 
15 at position -99 is due to pfu 3'->5' exonuclease 
proofreading activity that sharply peaks up at its 
optimal polymerization temperature, removing the 
mismatched nucleotide added at the initiation step of 
primer extension. It is apparently different from the 
20 3>-»5' exonuclease proofreading activity of other DNA 
polymerases such as pol-a and T4 in the primer mispair 
_ extension reaction. The mispair at the initiation site 
of DNA synthesizing can be produced with high frequency 
by these DNA polymerases in the primer mispair 
25 formation and extension reaction (Perinno, F., et al., 
Proc. Natl. Acad. Sci . USA. 86:8343-8347, 1989), 
(Reha-Krantz, L.J., Set al., Proc. Natl. Acad. Sci. 
USA. 88:2417-2421, 1991) . 

Taq DNA polymerase without 3<->S< exonuclease 
30 proofreading activities is used in virtually all PCR 
based genotyping assays (Okamoto, H., et al . , J". Gen. 
Virol. 73: 673, 678,(1992), (Newton, C.R. , et al . , 
Nucl. Acids Res., 17: 2503-2516). The unreliability 
caused by cross reactivity and wrong priming has been a 
35 big concern (Lau, J.Y., et al . , J. Infect. Dis. 171: 
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281-289, 1995), and Forns, X., et al . , J. Clin. 
Microbiology. 34-10: 2516-2521, 1996). When Tag is 
used instead of pfu, with the other conditions of the 
assay being the same, the primer 1BR extension took 
5 place on template la despite a mismatched pair existing 
at primer extension initiation site (-99)" (Fig. l-III- 
2) • This clearly indicates that the major cause of 
these nonspecific reactivities is due to the infidelity 
of Taq during the initiation of DNA synthesis (Perinno, 
10 F., et al., Proc. Natl. Acad. Sci. USA. 86: 8343-8347, 
1989) . 

In addition, since most DNA polymerases 
including Taq and reverse transcriptase of retroviruses 
exhibit a non-template dependent DNA synthesis activity 
15 (i.e. terminal deoxynucleotide transferase activity, 
IDT) , several more nucleotides can be added to the 3 T 
end of the primer (i.e. single strand DNA) (Clark, 
J.M., Nucleic Acids Res. 16: 9677), (Patel, P.H., et 
al., Proc. Natl. Acad. Sci. U.S.A. 91: 549-553). Thus 
20 Taq would be able to continue synthesizing DNA by 
addition of a correct paired nucleotide next to the 
mispaired nucleotide, in particular to the nucleotide 
in the A:C mispair that can be formed and extended more 
efficiently than other mispairs (Perinno, F., et al . , 
25 Proc. Natl. Acad. Sci. USA. 86: 8343-8347, 1989), 
(Newton, C.R., et al., Nucl. Acids Res. 17: 2503- 
2516) . A DNA strand with mismatched nucleotides would 
be generated and then act as a template that would be 
subsequently re-amplified, generating a large number of 
30 non-specific DNA molecules. Therefore, the nonspecific 
reaction can not be completely avoided with the methods 
based on DNA amplification with primer-specific PCR 
using Tag DNA polymerase. In contrast, in PSMEA, only 
a single primer is used, so that the extended primer 
35 cannot be re-amplified, and any mispairs, including A:C 
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in front of the 3 ' end of primer could completely stop 
primer extension. TLI/Vent DNA polymerase exhibits 
3»-»5» proofreading exonuclease activity, however, 
unlike pfu, TLI/Vent severely degraded single stranded 
5 DNA (i.e. primer) in PSMEA. 

In Pigs. 1, 1-1/ primer extension was performed 
using pfu with labeled primers (1-1) , pfu with labeled 
dNTPs (l-II), and Taq with labeled primers (l-III) . 
The symbol f represents where the nucleotide 

10 incorporation starts. The symbol (X) denotes a stop of 
primer extension caused by the nucleotide mispair(s). 

The second feature of specific and mispair 
extension by pfu is that one correct pairing followed 
by more than one mispair in front of the . 3 1 end of the 

15 primer is not sufficient for primer extension (Figs. 2, 
1-1 and 1-2) , at least two consecutive correct pairings 
are required (Fig. 2, 1-4 and II-2) . This provides a 
means to identify nucleotide deletion and insertion as 
well as multiple nucleotide variations or mutations in 

20 PSMEA. 

In Fig. 2, Primer 6AR-1 could not be extended 
with template la when using either dCTP plus dTTP, or 
dCTP plus dGTP as substrates due to a CA deletion ( . . ) , 
resulting either in a mispair at nucleotide position 

25 -145 in front of the 3' end of primer (1-1) or in only 
one matched pairing at the position. (1-2) . Primer 6AR- 
1 could not be extended on template 6a when using dCTP 
and dGTP (1-3). However, the primer was extended by 
three bases with template 6a using dGTP and dTTP which 

30 matched the nucleotides in the CA insertion (CA) in 
template 6a (1-4) . Primer 6AR-2 was extended when 
using template 6a (II-l and II-2) , but not with 
template la using either dATP, dCTP and dGTP or only 
dGTP (II-3 and II-4) . The symbol <r> denotes the 
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removal of the first nucleotide mismatched at the 3 1 
end of the primer. 

As seen in Fig. 2, there is a unique CA 
insertion in the 5 1 UR of HCV genotype 6a. Use of the 
5 CA insertion can differentiate between 6a and other 
genotypes. In the instant application, the CA 
insertion is used as an example to show how nucleotide 
deletion and insertion could be identified. Based on 
the flanking nucleotide sequence of the CA insertion, 
10 two primers were designed. A first primer 6AR-1, 
located in the region before a CA insertion in HCV 
genotype 6a was extended only when using the correctly 
paired dNTPs (i.e. dGTP and dTTP) (Fig. 2, 1-4), but not 
those dNTPs (dCTP and dGTP) (Fig. 2, 1-3) that 
15 mismatched with the nucleotide A in the CA insertion. 
The second primer 6AR-2 was designed with the first 
nucleotide at its 3 l end being matched with the first 
nucleotide A in the CA insertion of template 6a. Thus 
the primer extended two bases or 11 bases on template 
20 6a, depending on the substrates of dNTPs used (Fig. 2, 
II -1 and II -2) , but not on template la due to a CA 
deletion that results in a 3 1 mispaired residue being 
removed after primer binding (Fig. 2, II -3 and II -4) . 

In accordance with the present invention, 
25 multiple nucleotide variation or mutation can be 
identified. For example, a unique TCA motif that is 
specific to genotype 3a and 3b could be identified and 
is used for differentiation between the two genotypes 
and other genotypes. 
30 Like mispair formation and extension by T4 DNA 

polymerase (Wilber, J. C, et al., M. S. Reverse 
transcript ase-PCR for hepatitis C virus RNA, p. 327- 
331. In D. H. Persing, T.F. Smith, F.C. Tenover, and 
T.J. White (ed,), Diagnostic Molecular Microbiology: 
35 Principles and Applications. American Society for 
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Microbiology, Washington D.C. 1993) , any two or more 
consecutive mispairs could completely terminate primer 
extension by pfu because of its 3 ! — >5 1 proofreading 
activity (Figs. 1, II-2 and 2, 1-3 and II-2) . It was 
5 also found that two or more mispairs, but separated 
with one or two correct base pairs {Figs. 1, II-l and 
2, 11*1) also could terminate primer extension by pfu. 
Use of the termination point caused by these mispairs 
as well as primer specific and mispair extensions on 
10 templates by pfu provided reliable information on 
nucleotide sequence in the given region of the 5 ' UR of 
HCV. 

The highest molecular weight band shown on the 
sequencing gels represents the longest sequence 

15 extended to the termination point that is specific to 
each of the HCV genotypes. As indicated in Fig. 1, II , 
for example, the highest band represents the primer 1AB 
extended to the nucleotide C at nucleotide position 
-100 before the termination point at position -99 and 

20 -98 that would produce two mispairs with the two 
adjacent nucleotides A in template la, while the 
highest band represents the primer extended to the 
nucleotide G at -99 before the termination point at 
position -98 and -96 in template lb, showing the single 

25 nucleotide difference between la and lb. This is 
another way to identify a single point mutation or 
variation in PSMEA in accordance with the present 
invention. 

Whatever dNTPs (i.e. one, two or three of the 
30 four dNTPs) are chosen, they must follow the 
"instruction" with the characteristics of primer 
specific and mispair extension by pfu. 

The nucleotide incorporation rate of pfu is one 
fifth (1/5) of that of Tag. Thus pfu-based PGR 
35 applications require a minimum extension time of 2.0 
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minutes/kb. The efficiency of nucleotide incorporation 
by pfu is high enough for PSMEA in which only less than 
20 bases of extension are required per cycle in the 
reaction. Over 50% of the excess primer with less 
5 template (molecular ratio of primer and template 10:1) 
could be extended in a 20 cycle reaction, generating 
strong signals with either 32 P-labeled primer or dNTPs. 
Thus, PSMEA offers not only an advantage of superior 
accuracy over current indirect DNA genotyping systems, 
10 but an extraordinary sensitivity for detection of mixed 
infections with different genotypes of HCV. 

Direct DNA sequencing is routinely used as the 
gold standard method for confirmation of the results 
from other assays (Tuveri, R. , et al . , Journal of 
15 Medical Virology 51: 36-41, 1997), (Forns, X., et al., 
J. Clin. Microbiology. 34-10: 2516-2521, 1996). In the 
present application, the sensitivity of PSMEA for 
identifying mixed infections was evaluated with direct 
DNA sequencing methods. In an experiment, the cDNAs 
20 (PCR products) from genotypes lb and 2a isolates were 
mixed in different proportions to mimic HCV mixed 
infections and heterozygotes . Fig. 3 shows that 
genotype 2a could be clearly identified by direct DNA 
sequencing only when it reached to 50% in proportion in 
25 the mix. When 2a molecules consisted of less than 25%, 
only some of nucleotide variation points could be 
manually recognized, but was not conclusive for the 
identification of which genotype it was. However, 2a 
consisting as low as about 3% in the mix was clearly 
3 0 detected by PSMEA, showing approximately 10 times more 
sensitivity than the direct DNA sequencing system. 

To further confirm the sensitivity of PSMEA, 
three samples identified to contain two or three 
genotypes by PSMEA were analyzed by direct DNA 
35 sequencing (Fig. 3, B) and the reverse hybridization 



WO 00/09745 PCT/CA99/00733 

-22- 

method. Results reveal that the presence of 2b was 
confirmed by the direct DNA sequencing method, but not 
by reverse hybridization method in the sample 
containing la as dominant population . 
5 Fig. 3 presents autoradiography results in A 

and the date of a computer analysis of automated 
sequencing in A'. Different proportions of genotypes 
lb and 2a: 0% to 100% (I), 50% to 50% (II), 75% to 25% 
(III), 87.5% to 12.5% (IV), 93.75% to 6.25% (V), and 
10 96.875% to 3.125% (IV) in the mix were analyzed with 
PSMEA (A) and direct DNA sequencing (A 1 ). Two 
genotypes, la and 2b, were identified in a thalassaemia 
patient sample by PSMEA (B) . N in B represents the 
negative reaction with primer 6AR and the sample from 
15 the thalassemia patient. 

To evaluate the feasibility of PSMEA for large 
cohort studies, a total of two hundred and forty five 
(245) samples from HCV seropositive blood donors and 
patients with chronic hepatitis were genotyped by this 
20 assay. The genotypes determined by PSMEA included la, 
lb, 2a, 2b, 2c, 3a , 3b, 4, 5a and 6a. The typeable 
rate of these samples with PSMEA was 95.5%. Eighty 
(80) of them were also typed with other indirect or 
direct DNA sequencing genotyping methods. The 
2 5 genotyping results from PSMEA were in 90-100% 
concordance with that from other genotyping methods 
including LiPA , RFLP and direct DNA sequencing. Less 
than 5% of untypeable samples were sequenced, 
indicating that those isolates being untypeable by 
30 PSMEA were either HCV mutants or unclassified 
genotypes. Results have proven a great utility of 
PSMEA for large cohort studies on viral genotyping. 

PSMEA has been further developed with an 
automatic and colorimetric format, creating a great 
35 capacity to quickly genotype a large number of HCV 
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isolates not previously possible. Therefore, PSMEA has 
a great potential for the identification of nucleotide 
variations and heterozygotes in many areas such as 
virology, bacteriology , human genetics, epidemiology 
5 and legal medicine. 

When a type or subtype -specific sequence is 
available for designing the primer, for example primers 
2A and 3A, no cross reativities with other genotypes 
are observed in PSMEA- Consistent results were 
10 obtained with primers 1AB, 2B, 2C, and 3R (Table 2) . 
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Primer 1AB is universal for genotypes la and lb 
(Fig. 1; III}/ differentiating the two genotypes from 
other genotypes except genotype 6a due to the 
homologous sequence in the region -131 and -99 between 
5 genotypes la/lb and 6a. Fortunately, a nucleotide A in 
the unique CA insertion between the nucleotide 
positions -145 and -144 in the 5' UR of 6a could be 
used as first paired nucleotide at the 3 f end of primer 
6AR, thus the primer specifically extended on template 
10 6a, but not other genotypes (Fig. 2, I) . 

The primer could be bound on other genotypes, 
but could not be extended due to the CA deletion in 
these genotypes versus the insertion in 6a 5'DR. It 
was thus found that, for example, the primer 6AR-1 
15 could not be extended because there a single C at 
position -145 which could be paired in presence in the 
presence of dTTP and dGTP, followed by three Ts in the 
template la, in the reaction {Fig. 2, 1-1) . However, 
the primer extended on template 6a in the same reaction 
20 conditions due to the presence of the two consecutive 
pairings C:G and A:T at positions between -145 and -144 
(Fig. 2, 1-4). Accordingly, the primer 6AR2 could be 
extended longer on template 6a when using three dNTPs 
(dATP, dGTP and dCTP) , showing a stronger signal to 
25 differentiate 6a from other genotypes (Fig. 2, Il-l) . 
In the same manner, primer 5AR designed with a 
nucleotide variation at -236 was subtype specific. 

As shown in Table 2, primer 3B that was 
originally designed for identification of genotype 3b 
30 exhibited cross reactivity with 4e in PSMEA due to the 
homologous sequence in the region -175 to -14 9 chosen 
between the two genotypes. Thus a small region (-99 to 
-79) that contains an unique TCA motif in 5 ! UR of 
genotypes 3a and 3b was used for designing the primer 
35 3R. Thus by using primers 3A and 3R, genotypes 3a and 
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3b could be differentiated from other genotypes. 
Similarly, three genotypes la, lb and 6a could clearly 
be identified with primer 1AB , 1BR and 6AR that 
exhibited a cross reactivity with 1AB . A total 11 
5 genotypes including six major genotypes (la, lb, 2a, 
2b, 3a, 3b) and some of uncommon genotypes (2c , 5a, 6a, 
4a and 4e) in Canada could be identified by PSMEA using 
11 primers (Table 2) . 

Extension of the mismatched 3 1 termini of DNA 
10 is a major determinant of the infidelity of the DNA 
polymerases that have no 3 '-^5' exonuclease activity, 
(Pistello, M. et al., J. Clin. Microbiol. 32: 232-234). 
With the PSMEA of the present invention, by using 
3 , ->5 l exonuclease proofreading activity, high 
15 detection specificity for nucleotide mutation or 
variation in a known gene has been achieved with the 
PSMEA of the present invention. A big advantage of 
this assay is that a single nucleotide variation, 
deletion and insertion can be accurately detected. In 

2 0 the genome of many natural virus mutants or drug 

resistant mutants there may be only a single nucleotide 
mutation that has potential genetic- or clinical 
significance. For example a substitution C for A at 
nucleotide position 1814 that destroys the precore 
25 initiation codon, will prevent production of HBVeAg . 
In some of drug resistant mutants, a single nucleotide 
mutation could cause a failure of an antiviral therapy. 
PSMEA of the present invention can thus be used for 
rapid screening of those mutants. 

3 0 A single point mutation could be associated 

with genetic disease in human such as a single point 
mutation resulting in an amino acid substitution 
(C282Y) in the gene, HLA-H for haemochromatosis , which 
was reported to be involved in iron metabolism 
35 disorder. Such single point mutation are frequently 
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found in viruses. For example, a substitution C for A 
results in loss of precore initiation codon of 
hepatitis B virus, preventing e-antigen (HBeAg) 
synthesis , 

5 The PSMEA of the present invention can be 

modified so as to replace radioactivity by a detectable 
label. Such a non- radioactive assay could be in the 
form of a colorimetric PSMEA in a microtiter-plate 
format . 

10 The present invention will be more readily un- 

derstood by referring to the following examples, which 
are given to illustrate the invention rather than to 
limit its scope. 

15 EXAMPLE X 

DETERMINATION OF HCV GENOTYPES 

Primer extension reactions contained 2 0 ng 

primer, 20-30 ng PCR product, 20 mM of each dCTP and 

dGTP, 10 mCi of each 32 p-labeled dCTP and dGTP, 1.25 

2 0 units pfu DNA polymerase and 10 ml 10X pfu reaction 

buffer (PDI) . When 5 1 -end 32 p-labeled primers were 

used, the 32 p-labeled dCTP and dGTP were omitted, and 

100 fiM of each dCTP and dGTP were used in the 

reactions. The primer extensions were performed in a 

2 5 reaction volume of 100 ml in a thermocycler (Perkin 

Elmer, GeneAmp 9600) . Twenty (2 0) cycles consisting of 
a denaturation step at 94 °C for 20 seconds, an 
annealing step at 64 °C for 20 seconds and an extension 
step at 72 °C for 35 seconds were performed. One 

3 0 microliter of the primer extension products was mixed 

with 1 fih of the sequencing stop solution (Pharmacia 
Biotech) and electrophoresed on 8% po lya cry 1 amide 8M 
urea TBE gels for one hour. Extension products were 
visualized by autoradiography. 
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For PCR amplification and sequence analysis, 
viral RNA was isolated from 100 ml of serum by- 
treatment with RNAzol B (Biotecx Laboratories, Houston, 
Tex.) as previously described in Wilber, J. C, 
5 Johnson, P. J., and Urdea, M. S. Reverse transcriptase- 
PCR for hepatitis C virus RNA, p. 327-331, In D. H. 
Persing, T.F. Smith, F.C. Tenover, and T.J. White 
(ed.), Diagnostic Molecular Microbiology: Principles 
and Applications. American Society for Microbiology, 
10 Washington D.C. 1993. RT-PCR was performed as 
described by Bernier et al (Bernier, L., et al . , J. 
Clin. Microbiol. 34: 2815-1818), with a set of primers 
that target highly conserved domains within the S'-UR. 
The nucleotides and primers were removed from the PCR 
15 products with the QIAquick™ PCR Purification Kit 
(QIAGEN) , using the procedure recommended by the 
product supplier. These purified PCR products were 
used for primer extension and automated sequencing 
analysis . 

20 HCV isolates from patients with HCV infection 

were typed either by the improved Inno Li PA kit II™ 
using the procedure provided by the supplier 
(Innogenetics N.V., Belgium) or by RFLP analysis. For 
RFLP analysis, HCV genotypes were determined by 

25 cleavage of the PCR products with restriction enzymes 
BstNI , Bsr, Hinfl, Maelll, Haelll, BstUl and ScrFl . 
Digests were analyzed by gel electrophoresis and 
ethidium bromide staining. 

30 EXAMPLE II 

PRIMER DESIGN AND PSMEA PROCEDURE 

In accordance with a preferred embodiment of 

the invention, the development of PSMEA is based on a 

single primer extension in the 5 ■ -UR of the HCV genome 

35 using pfu DNA polymerase in the presence of only dCTP 
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and dGTP in the reaction. Thus, under these reaction 
conditions, primer extension occurs when only the G 
and/or C nucleotides are added immediately downstream 
of the 3 ' end of the primer, since the incorporation of 
5 A or T nucleotide would in many instances be prevented 
by the 3' -5* exonuclease activity of pfu. The primers 
were designed to meet the following requirements: 

1. They must have type-specific sequences; and 

2. All primers used for PSMEA should exhibit a 
10 similar melting temperature . 

The nucleotide sequences of the primers used in 
this study are shown in Table 1. In the PSMEA 
procedure, the primer extension reactions contained 20 
ng primer, 20-30 ng PGR product, 20 mM of each dCTP and 

15 dGTP, 10 mCi of each 32 P-labeled dCTP and dGTP, 1.25 
units pfu DNA polymerase and 10 ml 10X pfu reaction 
buffer (Stratagene) . When 5 f -end 32 P~labeled primers 
were used, the 32 P-labeled dCTP and dGTP were omitted, 
and 100 /xM of each dCTP and dGTP were used in the 

20 reactions. The primer extensions were performed in a 
reaction volume of 100 ml in a thermocycler (Perkin 
Elmer, GeneAmp 9600) . Twenty cycles of 94 °C for 20 
seconds, 64 °C for 20 seconds and 72 °C for 35 seconds 
were performed. One microliter of the primer extension 

25 products were mixed with 1 fih of the sequencing stop 
solution (Pharmacia Biotech) and elect rophoresed on 8% 
polya cry 1 amide 8M urea TBE gels. Extension products 
were visualized by autoradiography. 

30 EXAMPLE III 

PCR AMPLIFICATION AND SEQUENCE ANALYSIS 

Viral RNA was isolated from 100 ml of serum by 

treatment with RNAzol B (Biotecx Laboratories, Houston, 

Tex.) as previously described in Wilber, J. C. , 

35 Johnson, P. J., and Urdea, M. S. Reverse transcriptase- 
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PCR for hepatitis C virus RNA, p. 327-331. In D. H. 
Persing, T.F. Smith, F.C. Tenover, and T.J, White 
(ed.), Diagnostic Molecular Microbiology: Principles 
and Applications. American Society for Microbiology, 
5 Washington D.C. 1993.. RT-PCR was performed' as 
described by Bernier et al. Bernier, L., et al., J. 
Clin. Microbiol. 34: 2815-1818, with a set of primers 
(see Table 1) that target highly conserved domains 
within the S'-UR. If a second round PCR was necessary 

10 to provide sufficient cDNA for PSMEA, a pair of nested 
primers (sense primer -211 to -192 and antisense primer 
-91 to -74) was used. The nucleotides and primers were 
removed from the PCR products with the QIAquick™ PCR 
Purification Kit (QIAGEN) , using the procedure 

15 recommended by the manufacturer. These purified PCR 
products were used for PSMEA and for automated 
sequencing analysis. 

EXAMPLE XV 

2 0 GENOTYPING OP HCV ISOLATES BY LiPA AND RFLP 

HCV isolates from patients with HCV infection 
were typed either by the improved Inno LiPA kit II™ 
using the procedure provided by the supplier 
(Innogenetics N.V., Belgium) or by RFLP analysis. For 
25 RFLP analysis, HCV genotypes were determined by 
cleavage of the PCR products with restriction enzymes 
BstNI, Bsr, Hinfl, Maelll, Haelll, BstUl and ScrFl, 
(Andonov, A., et al . , J\ Clin. Microbiol. 32: 2031- 
2034, 1994). Digests were analyzed by gel 

3 0 electrophoresis and ethidium bromide staining. 

EXAMPLE V 

The accuracy and reliability of PSMEA 

To evaluate the accuracy and reliability of 
3 5 PSMEA, 51 HCV isolates from HCV infected individuals in 
Montreal, Canada, typed by RFLP analysis in the 5 1 UR 
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(Andonov, A., et al . , J". Clin. Microbiol. 32: 2031- 
2034, 1994) were analyzed with PSMEA in a double blind 
study. The subtypes 2a and 2c were grouped together 
when RFLP was used for genotyping of the 51 isolates, 
5 since some 2c variants share common ScrFI cleavage 
sites with 2a variants. 

A primer designed with sequence -133 to -113 
from this isolate has shown no cross reactivity with 
other types described in the present study (Table 2) , 

10 suggesting that most isolates determined as 2a/2c by 
RFLP have a typical 2a specific sequence, and that the 
primer 2A can discriminate the majority of subtype 2a 
from subtype 2c. The results from the analysis of 
these 51 isolates indicate that 100% of types and 

15 subtypes for the isolates determined by PSMEA were 
matched with the types and subtypes identified with 
RFLP. In these isolates, there were 14 -la, 14 -lb, 5- 
2a, 5-2b, l-2c, 10-3a, l-3b, and l-6a. Li PA was 
reported to reliably type the most common genotypes, 

20 including some subtypes (Simmonds, P., Variability of 
Hepatitis C. Virus. Hepatology 21: 570-582, 1995)* 
Fifteen (15) isolates were typed by Li PA and evaluated 
with PSMEA in accordance with the present invention. 
They include 5-la, 6-lb, l-2a, l-2b, l-3a, l-3b. The 

25 results from PSMEA showed a 100% agreement with that 
from LiPA. 

A group of five (5) HCV isolates that failed to 
be typed or subtyped by other methods were clearly 
identified as la or lb by PSMEA. The results were 
30 confirmed by direct DNA sequencing using their PCR 
products from the 5' UR region (-211 to -71) {Fig. 4), 
suggesting that results from PSMEA is reliable. 

In the present application, the majority of the 
samples were typed with PSMEA using non-labeled primers 
35 and 32 p-labeled dNTPs. Results obtained using 32 p- 
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labeled dWTPs with unlabelled primers showed the 
typical patterns of the primer extensions as seen in 
the results from the reaction with 32 p- labeled primers 
and non-labeled dNTPs . 
5 Accordingly the new genotyping assay, primer 

specific and mispair extension assay (PSMEA) of the 
present invention was used to genotype HCV and to 
detect mixed infections . A total of one hundred and 
forty six (146) HCV isolates were typed and analyzed 

10 with PSMEA, showing that nine of 110 isolates (8-2%) 
from HCV positive blood donors and six of 3 6 isolates 
(16.7%) from thalassaemia patients were found to 
contain more than one genotypes. The results were 
confirmed and compared with other current assays 

15 including direct DNA sequencing and line probe assay 
(LiPA) . PSMEA of the present invention was found to be 
more reliable than other assays in detecting mixed 
infection. 

20 EXAMPLE VI 

Feasibility of PSMEA for large cohort studies 

Some genotypes including la, lb, 2a, and 2b 

show a broad geographical distribution, and the 

infection frequency of subtypes la and lb can be over 

25 50% in blood donors and patients with chronic hepatitis 
in the United States (Lau, J.Y., et al . , J. Infect. 
Dis. 171; 281-289, 1995), Canada (Bernier, L. , et al . , 
J. Clin. Microbiol. 34: 2815-1818), (Andonov, A., et 
al., J. Clin. Microbiol. 32: 2031-2034, 1994), and most 

30 European countries (Simmonds, P., Hepatology 21: 570- 
582. 1995), (Stuyver, L., Rosseau, et al . , J*. Clin. 
Microbiol. 34: 2259-2266, 1996). Other genotypes, such 
as 3a and 3b are less common than la and lb in those 
countries. However, 3a seems to be quite frequently 

35 found in Canada. Genotypes 4, 5a and 6a are only found 
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in specific geographical' regions in the Middle East, 
South Africa and Hong Kong respectively, but were also 
infrequently found in some areas of Canada. It is 
apparent that for each region a strategy for genotyping 
5 a large number of HCV isolates by PSMEA has to be 
designed, based on the genotype distribution and 
infection frequency with the population. For example, 
in Canada, over 60% of HCV isolates are genotypes la, 
lb, which can be identified by primers 1AB, 1BR. 
10 However, since genotype 6a was frequently found in some 
areas of Canada and primer 1AB was cross reacted with 
6a, thus all isolates diagnosed as la should be 
retested with primer 6AR for screening of genotype 6a 
in the first round of testing with PSMEA. The non- 
15 typeable isolates using the three primers should be 
retested with primers 2A and 2B. Thus genotypes 2a and 
2b (over 15% of total isolates) can be determined in 
the second round of testing. The remaining non- 
typeable isolates should be screened with primers 3A, 
20 3B and 3R. Genotypes 3a and 3b (over 15% of total 
isolates) can be identified in the third round of 
testing. After the three round testing, the rest of 
isolates (less than 10% of total isolates) that cannot 
be typed by these primers would include some subtypes 
25 of type 4, subtype 5a, subtype 2c or other genotypes . 
Table 3 identifies the infection frequencies of the 
major HCV genotypes identified by PSMEA, indicating 
that it is practical and feasible for genotyping a 
large number of isolates for epidemioloy and clinical 
3 0 studies. 

All current genotyping assays such as for HCV, 
including direct DNA sequencing, are not suitable for 
detection of mixed infections because they are designed 
for detection of the population-dominant genotype. As 
35 indicated in Table 3, HCV mixed infection rate was 
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higher than expected. A reliable detection of HCV mixed infections by PSMEA in different 
populations with HCV infection is thus reported. 

EXAMPLE VII 
DETERMINATION OF HIV-1 MUTANTS 

HIV positive sequential plasma samples were collected from patients on the combination 
therapy with RT and protease inhibitors for greater than three months. 

Viral RNA from samples was purified using the QIAamp™ Viral RNA kit. The Pharmacia 
Biotech First-Strand cDNA Synthesis kit was then used for cDNA synthesis. PCR was performed 
to amplify the region containing the entire protease and RT regions of HIV-1. The PCR products 
were purified using the QIA quick™ PCR Purification kit for genotypic analysis with PSMEA, 
direct DNA sequencing and cloning. 

PSMEA was performed with the procedure described above. This assay has been developed 
as a semi- 
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automated system. Accordingly, a CyS.5 dye-labeled 
primer (at the 5' -end of the primer) was used instead 
of a 32 P labeled primer. Primer extension reactions 
contained 20 ng of 5' -end 5.5 dye-labelled primer, 20- 
5 30 ng of PCR product, 20 fiU of each dNTP, 1.25 units of 
pfu DNA polymerase and 10 jiL of 10X pfu reaction buffer 
(PDI) . Primer extensions were performed in a 100 
reaction volume in a thermocycler (Perkin Elmer, 
GeneAmp 9600) . Twenty cycles of 94 °C for 20 seconds, 

10 64 °C for 20 seconds and 72 °C for 35 seconds were 
performed. One microliter of the primer extension 
products was mixed with 1 of the sequencing stop 
solution (Pharmacia Biotech) and electrophoresed on 6% 
polyacryl amide 8M urea TBE mini gels for 10 minutes. 

15 Extension products were analysed by automated DNA 
fragment length polymorphism analysis software in the 
OpenGene™ automated DNA sequencing system (Visible 
Genetics) ♦ 

Both direct DNA sequencing and population based 

20 sequencing were performed with an automated DNA 
sequencer (Visible Genetics) and DNA sequencing kits 
(Visible Genetics) . For the population based DNA 
sequencing, the PCR products were directly cloned into 
the TOPO™ TA vector (Invitrogen Corp.). The TTV 

25 sequence was amplified from positive colonies of 
Escherichia coli using M13 primers (Pharmacia Biotech) . 
The re-amplified PCR products were sequenced using the 
same procedure as used for direct DNA sequencing. 

Based on the principle of PSMEA, primers can be 

3 0 designed to be either universal for both wild type and 
mutants or type-specific (i.e. specific to either wild 
type or mutants) . In addition, many options can be 
explored with different sets of dNTPs and locations of 
primer binding. To test the feasibility of PSMEA for 

35 HIV-1 genotyping, three most common and important 
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mutation sites of codons 82 (V82) , 63 (L63) and 90 
(L90) were detected by PSMEA and direct DNA sequencing. 
Figs. 5A and 5B show an example to indicate the primer 
design and expected base extensions using different 
5 sets of dNTPs for detecting mutation of V82A with a 
universal primer. In Figs/ 5A and 5B, each combination 
of dNTPs results in a different extension profile, 
providing a fingerprint for wild type and mutant. In 
Fig. 5A, extension profile for wild type and mutant are 
10 illustrated. Fig. 5B,. the peak profiles represent the 
number of bases extended as detected by automated DNA 
sequencing using fragment length polymorphism analysis 
software, with the first peak from left representing 
unextended primer. When using specific sets of dNTP's; 
15 ATC, ATG, ACG and TCG, the primer will extend 3, 1, 0 
and 2 bases with wild type. In contrast, extensions of 
0, 1, 5, and 2 bases will occur with mutant. These 
results clearly indicate that the predominant 
population in this sample was V82A mutant, showing a 
20 pattern of 0, 1, 5 and 2 bases extended. However, in 
the presence of ATC, about 12% of the primer extended, 
indicating a low proportion of the wild type virus 
still remained in this patient. Apparently only two 
sets of dNTPs (ATC and TCG) were needed to determine 
25 the levels of mutant ant wild type in a mix. In some 
cases, V82 could mutate to V82I instead of V82A. A 
primer was designed for detection of V82I mutation 
using a set of dNTPs (A, T, and C) . 

It is more difficult to detect drug resistant 
30 mutations at codon 63 than other sites because in this 
case there are 6 combinations of codons for Leucine 
(L63) and 4 codons for Proline (P63) . Since a wild 
type/mutant mix would almost certainly be caused by a 
single base substitution, only four variations should 
35 be expected, with only CTC/CCC (L63) and CTT/CCT 
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combinations seen so far, with the former combination 
being the most common. To overcome this problem, a new 
type of primer was designed for PSMEA to detect 
different drug resistant mutations of L63 codon as 
5 follows: the target mutation point was included as part 
of the 3' end of two primers. The 3 ' end of one primer 
(PROS3L (G) -R) was a perfect match to the wild type, but 
not mutant (Figs. 6A(1) and 6A(2)), while the 3' end of 
the second primer (PR063P (G) -R) was a perfect match to 
10 the nucleotide sequence of mutant, but not wild type 
(Figs. 6A(3) and 6A(4)). In Fig. 6A, the matched 
primers should have 4 and 14 bases extension on both 
L63 and L63P (1) (3) , and the primers with mispairs on 
L63P and L63 were expected to have no extension (2) 
15 (4) . In Fig. 6B, the primer extended either 4 and 14 
bases (second and last peaks from the left) (1) (3), or 
virtually not at all (2) (4), 

If a mismatch occurs on the 3' end of the 
primer, the Pfu will remove the mismatched nucleotides. 
20 A mix of 3 dNTP's was added to the reaction in which 
there are no dNTPs to correct the mismatches on the 3' 
end of the primer. Thus, primer extension occurs under 
these circumstances only if the 3' end of the primer 
was a perfect match to the target mutation point. This 
25 results in very little if any, non specific extension, 
even at annealing temperatures as low as 55°C. When 
dNTPs {A, T and C) were added, the matched primer was 
extended with 5 and 14 bases extensions (Figs. 6A(1) , 
(2), (3) and 6B(1), (3)). However the mismatched primer 
30 could not be extended because the mismatched 
nucleotides at 3' end of the primer were removed with 
no dGTP to repair the mismatch (Figs. 6A(2) , 6A(4) and 
6B(2), (4)). 

Unlike protease codon 63 which has multiple 
35 combinations for L63 mutating to L63P, and codon 82 
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with V82 mutating to V82 1 /A/others , codon 90 has only 
one possible combination for L90 mutating to L90M (AAG 
to ATG) . As a result, only one pair of primers is 
required for detection of this mutation point. Figs. 7A 
5 and 7B show that the primer (PRO90L-R) was designed in 
a reverse direction for detection of L90. Pig. 7A 
illustrates the expected number of bases extended with 
matched and mismatched primers on wild type and mutant 
respectively (1-4). In Fig. 7B, primers PRO90L-R and 
10 PRO90M-F on wild type were extended with 4 on wild type 
(1) and 8 bases on mutant (3) . 

As expected, the primer was extended with four 
bases (Figs. 7A(1) and 7B{1)). Under the same reaction 
conditions, the primer was not extended on mutant L90M 
15 due to the T:T mismatch at the 3 ' end of the primer 
(Figs. 7A(2) and 7B(2)). Another primer (PRO90M-F) was 
designed to detect low levels of drug resistant mutant 
L90M. The primer was extended on L90M mutant with 
eight bases (Figs. 7A(3) and 7B(3)), but not on wild 
2 0 type due to the second to last nucleotide mismatch 
(A:A) at the 3' end of the primer (Figs. 7A(4) and 
7B(4)) . 

In order to assess the sensitivity of PSMEA for 
detection of low levels of mutants in a viral 

25 population, an artificial mixing experiment was carried 
out. PCR products from the protease region of V82 wild 
type and the V82I mutant were mixed in the following 
proportions between mutant and wild type: 3.1% (1/32), 
1.6% (1/64), 0.8% (1/128) and 0.0% (i.e. 100% wild 

30 type). Fig. 8 shows that 0.8% mutant in the mix was 
clearly detected with PSMEA. The levels at 0.8%-3.1% 
were not detected by direct DNA sequencing. It has 
previously been reported that the mixed genotypes or 
quasispecies could be detected only when mixed 
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genotype (s) reached levels over 25% (Deeks SG, Abrams 
DI., Lancet, 349: 1489-1490, 1997). 

To confirm the sensitivity of PSMEA, a total of 
32 isolates from patients being treated with antiviral 
5 drugs have been analysed with both direct DNA 
sequencing and PSMEA for the presence of drug resistant 
mutations in codon V82, L63 and L90 from the protease 
region of HIV-l. Table 4 shows a 100% concordance of 
the genotypic testing results between PSMEA and direct 
10 DNA sequencing for detection of predominant population 
of variants including mutants and wild type. 
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Twenty five (25) (78.1%) of 32 samples determined as being fully wild type (V82) were 
found to contain either mutants V82I or V82A at various levels (1-19%). Different levels (3-27%) 
of L63P mutant were detected by PSMEA in 100% (5/5) of the samples that were determined as 
purely wild type (L63) by direct DNA sequencing. However, in detecting L90M mutation, the 
results showed 86.7 % (13/15) concordance between the two methods. 

One of the samples (AR283) was analyzed using population based sequencing, direct DNA 
sequencing and PSMEA, to further evaluate the accuracy of PSMEA in detecting low levels of drug 
resistant mutants. Table 5 shows that PSMEA demonstrated greater concordance with population 
based sequencing, than direct DNA sequencing for monitoring minor variants. Seventy-six percent 
(76%) and twenty percent (20%) of isolate from AR283 was found to be V82I mutant and wild type 
V82 respectively by PSMEA, but 1 00% mutant V82I by direct DNA sequencing. Population based 
sequencing showed that wild type truly existed at level of 12.5% (4/32). For detection of 
predominant populations variants (mutant V82I and wild type L90) in the isolate, both direct DNA 
sequencing and PSMEA were 100% concordant with population based sequencing. 

From the above, it is thus apparent that PSMEA is more sensitive than direct DNA 
sequencing for detection of low levels of drug resistant mutants. The high degree of sensitivity of 
PSMEA should allow one to more precisely understand how early drug resistant mutations can 
develop, especially during antiretroviral therapy. For clinics, this may further aid in guiding 
clinicians in modulating therapeutic regimens before the appearance of phenotypic drug resistance 
or deterioration in clinical status. With early warning of drug specific mutations, agents can be 
chosen to impede resistance and minimize cross resistance, yet still have an antiviral effect on the 
patient's dominant quasispecies. This in turn will hopefully delay therapeutic failure and increase 
survival. 
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SEQUENCE LISTING 

<110>Hu, Yu-Wen 

<120> PRIMER-SPECIFIC AND MISPAIR EXTENSION ASSAY FOR IDENTIFYING 
GENE VARIATION 
<130> 2883-4757US 
<160> 49 

<170> Patentln version 3.0 
<210> 1 
<211> 25 
<212> DNA 
<213> Artificial 

<220><223> sense universal primer for PCR (first round) 
<400> 1 

ctcccctgtg aggaactact gtctt 25 



<210> 2 
<211> 20 
<212> DNA 
<213> Artificial 

<220x223> antisense universal primer for PCR (first round) 
<400> 2 

ctcgcaagca ccctatcagg 20 



<210> 3 
<211> 31 
<212> DNA 
<213> Artificia 

1<220><223> sense universal primer for PCR (second round) 
<400> 3 

ccatagtggt ctgcggaacc ggtgagtaca c 31 



<210> 4 
<211> 18 
<212> DNA 
<213> Artificial 

<220x223> antisense universal primer for PCR (second round) 



<400> 4 

cccaacacta ctcggcta 



<210> 5 
<211> 21 
<212> DNA 
<213> Artificia 

I<220x223> primer for PSMEA 
<400> 5 

ctcaatgcct ggagatttgg g 



<210> 6 

<211> 20 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 6 

caccggaatt gccaggacga 

<210> 7 

<211> 21 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 7 

acactactcg gctagcagtc t 



<210> 8 

<211> 21 

<212> DNA 

<213> Artificial 

<220><223> primer for PSMEA 

<400> 8 

ccactctatg cccggtcatt t 



<210> 9 

<211> 21 

<212> DNA 

<213> Artificial 

<220><223> primer for PSMEA 

<400> 9 

tatgtccggt catttgggca c 



<210> 10 

<211> 21 

<212> DNA 

<213> Artificial 

<220x223> primer for PSMEA 

<400> 10 

cactctgtgc ccggccattt g 



<210> 11 

<211> 19 

<212> DNA 

<213> Artificial 

<220><223> primer for PSMEA 

<400> 11 

accggaatcg ctggggtga 

<210> 12 
<211> 19 
<212> DNA 
<213> Artificia 

1<220><223> primer for PSMEA 
<400> 12 

accggaatcg ccgggatga 



<210> 13 
<211> 21 
<212> DNA 
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<213> Artificial 

<220><223> primer for PSMEA 

<400> 13 

cactactcgg ctagtgatct c 21 



<210> 14 
<211> 19 
<212> DNA 
<213> Artificia 

l<220x223> primer for PSMEA 
<400> 14 

gggggtcctg gaggctgtt 



<210> 15 

<211> 22 

<212> DNA 

<213> Artificial 

<220><223> primer for PSMEA 

<400> 15 

cattgagcgg gtttgatcca at 22 



<210> 16 
<211> 12 
<212> DNA 
<213> Artificial 

<220><223> Extension product of wild-type HTV-1 
<400> 16 

agtccacatc ct 12 



<210> 17 
<211> 12 
<212> DNA 
<213> Artificial 

<220><223> Extension product of wild-type fHV-1 
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<400> 17 

cgtccacatc ct 12 



<210> 18 
<211> 41 
<212> DNA 
<213> Artificial 
<220> 

<223> L63 + primer sequences 
<220> 

<221> misc_feature 

<222> (31)..(41) 

<223> n can be any nucleotide 

<400> 18 

gctttatgtc cacagatttc tatgagtacc tnatcatact n 



<210> 19 
<211> 44 
<212> DNA 
<213> Artificial 

<220x223> L63P + primer sequences 
<400> 19 

agacagtatg atcaggtact catagaaatc tgtggacata aagc 



<210> 20 
<211> 26 
<212> DNA 
<213> Artificial 
<220><223> pro63L(G)-R 

<400> 20 

gctttatgtc cacagatttc tatgag 



<210> 21 
<211> 44 
<212> DNA 
<213> Artificial 



<220><223> L63P + primer sequences 



<400> 21 

agacagtatg atcaggtacc catagaaatc tgtggacata aagc 



<210> 22 
<211> 41 
<212> DNA 
<213> Artificial 

<220x223> L63 + primer sequences 

<220x221> misc_feature 

<222> (32)..(41) 

<223> n can be any nucleotide 

<400> 22 

gctttatgtc cacagatttc tatgggtacc tnatcatact n 



<210> 23 
<211> 26 
<212> DNA 
<213> Artificial 
<220><223> PRO90L-R 

<400> 23 

gctttatgtc cacagatttc tatggg 



<210> 24 
<211> 33 
<212> DNA 
<213> Artificial 
<220><223> L90 

<220><221> misc_feature 

<222> (32)..(33) 

<223> n can be any nucleotide 

<400> 24 

atttaaagtg caaccaatct gagtcttcag ann 



<210> 25 
<211> 33 
<212> DNA 
<213> Artificial 
<220x223> L90 

<400> 25 

aatctgaaga ctcagattgg ttgcacttta aat 



<210> 26 
<211> 27 
<2]2> DNA 
<213> Artificial 
<220x223> PRO90L-R 

<400> 26 

atttaaagtg caaccaatct gagtctt 

<210> 27 
<211> 33 
<212> DNA 
<213> Artificial 
<220x223> L90M 

<400> 27 

aatctgatga ctcagattgg ttgcacttta aat 



<210> 28 
<211> 38 
<212> DNA 
<213> Artificial 
<220x223> PRO90M-F 

<220><221> misc_feature 

<222> (37)..(38) 

<223> n can be any nucleotide 

<400> 28 

ctgtcaacat aattggaaga aatctgatga cgcagann 



<210> 29 
<211> 38 
<212> DNA 
<213> Artificial 
<220x223> L90M 

<400> 29 

aatctgcgtc atcagatttc ttccaattat gttgacag 



<210> 30 
<211> 28 
<212> DNA 
<213> Artificial 
<220><223> PRO90M-F 

<400> 30 

ctgtcaacat aattggaaga aatctgat 



<210> 31 
<211> 38 
<212> DNA 
<213> Artificial 
<220><223> L90 

<400> 31 

aatctgagtc aacagatttc ttccaattat gttgacag 



<210> 32 
<211> 13 
<212> DNA 
<213> Artificial 

<220><223> Extension product of a mutant HIV-1 

<220x221> misc_feature 

<222> (12)..(13) 

<223> n can be any nucleotide 



<400> 32 
agttgtatta ann 
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<210> 33 
<2U> 18 
<212> DNA 
<213> Artificial 

<220><223> Extesion product of a wild HIV-1 

<400> 33 
tcaacataat tggaaaga 



<210> 34 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 34 

tttgggcgtg cccccgcaag ac 22 



<210> 35 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype lb 
<400> 35 

tttgggcgtg cccccgcgag ac 22 



<210> 36 

<211> 18 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<222> (16)..(18) 

<223> N can be any nucleotide 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
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<400> 36 

cgcgggggcg cgcccnnn 



<210> 37 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 37 

gtcttgcggg ggcacgccca aa 22 



<210> 38 

<211> 13 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<220x221> misc_feature<222> (12)..(13)<223> n can be any nucleotide 
<400> 38 

cgcgcccccg cnn 13 

<210> 39 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype lb 
<400> 39 

gtctcgcggg ggcacgccca aa 22 



<210> 40 

<211> 15 

<212> DNA 

<213> Hepatitis C virus 
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<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype lb 

<220><221> misc_feature 

<222> (13)..(15)<223> n can be any nucleotide 

<400> 40 

cgcgcccccg cgnnn 15 



<210> 41 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 41 

tttgggcgtg cccccgcaag ac 22 



<210> 42 

<211> 18 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 
<400> 42 

cgcgggggcg cgcccaaa 18 



<210> 43 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype la 

<220x221> misc_feature 

<222> (17)..(22) 

<223> n can be any nucleotide 
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<400> 43 

aggaccgggt cctttcnntt gg 22 



<210> 44 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 6a 
<400> 44 

aggaccgggt cctttccatt gg 22 



<210> 45 

<211> 22 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 6a 
<400> 45 

aggaccgggt cctttccatt gg 22 



<210> 46 

<211> 15 

<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 6a 

<220><221> miscjeature 

<222> (15)..(15) 

<223> n can be any nucleotide 

<400> 46 

ggaaagggac ccggn 15 



<210> 47 
<211> 22 
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<212> DNA 

<213> Hepatitis C virus 

<220x221> misc_Jeature 

<223> nucleotide sequence on templates for HCV genotype la 

<220><221> misc_feature 

<222> (17)..(22) 

<223> n can be any nucleotide 

<400> 47 

aggaccgggt cctttcnntt gg 22 



<210> 48 

<211> 48 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature 

<223> nucleotide sequence on templates for HCV genotype 2a 
<400> 48 

tggccgggca tagagtgggt ttatccaaga aaggacccag tcttcccg 48 



<210> 49 

<211> 48 

<212> DNA 

<213> Hepatitis C virus 

<220><221> misc_feature< 

223> nucleotide sequence on templates for HCV genotype lb 
<400> 49 

tctccaggca ttgagcgggt tgatccaaga aaggacccgg tcgtcctg 48 
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Table 3 

The prevalence of HCV mixed genotype infections in different populations determined by 
PSMEA and direct DNA sequencing 





PSMEA 


Direct DNA sequencing 


Blood donors 


7/72 


ND 




(9.7%) 




Patients with chronic 


15/80 


8/80 


hepatitis C 


(18.8%) 


(10.0%)* 


Thalassemia patients 


6/36 


4/36 




(16.7%) 


(11.1%) 



# The samples determined to contain more than one genotype by direct DNA sequencing were 100% 
concordant with that determined by PSMEA. Since direct the DNA sequencing method is not sensitive 
enough to detect low levels (<25%) of mixed genotype infections, the percentage of mixed genotype 
infections determined by direct DNA sequencing is lower than what was found by PSMEA. 
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Tahle.4 . 

Comparison of the sensitivity between direct DNA 
sequencing and PSMEA for detecting low levels of drug 

resistant mutations 



i 


Direct DNA Sequencing 


PSMEA 




L63/P/C/ 


V82/I/A 


L90/M 


L63 


L63P/ 


V82/V82IA/82A 


L90rt_90M 


Sample 


L63T 


(W/M) 


(W/M) 


(%) 


L63C 




(%) 




(%) 


ID 


(W/M) 




















4B 


L63 


V82 


L90 






Q1 

v i 


7 








6B 


LUur 






*i 


97P 


SO 


n 








7B 


L63C 


V82 


L90 






OO 


I** 


o 
o 






8B 


L.OOr 


VOi 




£ 


jOr 


on 
oa 


n 
O 


o 

0 






GL 


l COD 










97 


<1 


2 


7 


93 


JB 


t coo 


\/QO A A/ 

VoZA/V 


LyUM/L 






31 


15 


54 


56 


44 


MN 


LB3P 


\ /on a 

V82A 


L90M 


0 


100P 


22 


10 


68 


5 


95 


CA 


L63P 


V82 


L90 


0 


100P 


89 


9 


2 


100 


0 


DS1 


L63P 


V82 


L90 






86 


12 


2 






DS2 


L63P 


V82 


L90 


3 


97P 


95 


3 


2 






GP 


L63P 


V82 


L90 






91 


3 


6 


100 


0 


AS 




V82A 








14 


4 


82 






FP 


L63P 


V82 


L90 






90 


7 


3 


100 


0 


M15-8 


L63C 


V82 


L90 


49 


51C 


93 


5 


2 






M15-10 


L63 


V82 


L90 


73 


27C 


90 


6 


4 






M17-17 


L63C 


V82V/I 


L90 


50 


50C 


71 


27 


2 






M17-22 


L63C 


V82 


L90 


48 


52C 


90 


8 


A A 

10 






M 17-25 


L.DO 






OO 




54 


17 


29 






M17-33 


L63T 


V82 


L90 






84 


14 


2 






M17-35 


L63T 


V82 


L90 






81 


16 


3 






M17-37 


L63 


V82 


L90 


94 


6C 


79 


19 


2 






M17-41 


L63 


V82 


L90 


97 


3C 


93 


4 


3 






M 17-44 


L63P 


V82 


L90M 






93 


5 


2 






98027 


L63 


V82 


L90 


96 


4P 


89 


9 


2 


99 


1 


V894713 


L63 


V82 


L90 






89 


9 


2 


100 


0 


AR283 


L63P 


V82I 


L90 






20 


76 


4 


100 


0 


AR929 


L63P 


V821/V 


L90 






33 


65 


2 


100 


0 


AR2429 


L63P 


V82V/I 


L90 






39 


60 


1 


100 


0 


AP6423 


L63L7P 


V82 


L90 






90 


8 


2 


100 


0 


AP6617 


L63P 


V82 


L90 






89 


10 


1 


100 


0 


AP7010 


L63P 


V82 


L90 






90 


9 


1 


100 


0 


AP7366 


L63P 


V82 


L90 






90 


8 


2 


100 


0 



The percentage of mutant in a mix is calculated as follows: 
% mutant = (% wild type extension + % mutant extension) x 100 



Twenty five (25) (78.1%) of 32 samples 
determined as being fully wild type (V82) were found to 
contain either mutants V82I or V82A at various levels 
(1-19%). Different levels (3-27%) of L63P mutant were 
detected by PSMEA in 100% (5/5) of the samples that 
were determined as purely wild type (L63) by direct DNA 
sequencing- However, in detecting L90M mutation, the 
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Table 5 

Evaluation of the accuracy of PSMEA and direct sequencing for detection of low levels 
of variants with population based sequencing 



Method 



Population based 
sequencing 

Direct DNA 
sequencing 

PSMEA 



Genotypes 

%V821 %V82 %L90M 

87.5(28/32)* 12.5 (4/32) 0.0 (0/32) 



100.0 



76.0 



0.0 
20.0 



0.0 



0.0 



%L90 

100.0 (32/32) 

100.0 

100.0 



* Denotes that a total number of mutant or wild type found in 32 cloned PCR products 
sequenced. 



